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Abstract 

In this paper we obtain closed expressions for the probability distri¬ 
bution function, when we consider aggregated risks with multivariate 
dependent Pareto distributions. We work with the dependent mul¬ 
tivariate Pareto type II proposed by Arnold (1983, 2015), which is 
widely used in insurance and risk analysis. We begin with the indi¬ 
vidual risk model, where we obtain the probability density function 
(PDF), which corresponds to a second kind beta distribution. We 
obtain several risk measures including the VaR, TVaR and other tail 
measures. Then, we consider collective risk model based on depen¬ 
dence, where several general properties are studied. We study in detail 
some relevant collective models with Poisson, negative binomial and 
logarithmic distributions as primary distributions. In the collective 
Pareto-Poisson model, the PDF is a function of the Kummer conflu¬ 
ent hypergeometric function, and in the Pareto-negative binomial is 
a function of the Gauss hypergeometric function. Using the data set 
based on one-year vehicle insurance policies taken out in 2004-2005 
(Jong and Heller, 2008), we conclude that our collective dependent 
models outperform the classical collective models Poisson-exponential 
and geometric-exponential in terms of the AIC and CAIC statistics. 
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1 Introduction 


In the context of the classical theory of risk, consider the individual and the 
collective risk models (see Kass et al. (2001) and Kingman et ah (2008)). 
These models rely on the assumptions of the independence between: (i) 
different claim amounts; (ii) the number of claims and claim amounts and 
(iii) claim amounts and interclaim times. 

The use of the independence hypothesis facilitates the computation of 
many risks measures, but can be restrictive in different contexts. The need 
for generalizations in both individual and collective classical models (by con¬ 
sidering some kind of dependence structure) has led several researches to 
model dependence. 

In this context, Sarabia and Guillen (2008) have considered extensions of 
the classical collective model assuming that the conditional distributions S\N 
and N\S belong to some prescribed parametric family, where S is the total 
claim amount and N is the number of claims. Gomez-Deniz and Galderin- 
Ojeda (2014), using conditional specihcation techniques, have obtained dis¬ 
crete distributions to be used in the collective risk model to compute the 
right-tail probability of the aggregate claims size distribution. 

Albrecher and Teugels (2006) have considered a dependence structure 
based on a copula for the interclaim time and the subsequent claim size. 
Boudreault et al. (2006) studied an extension of the classical compound 
Poisson risk model, where the distribution of the next claim amount was 
dehned as a function of the time elapsed since the last claim. Gossette et al. 
(2008) have considered another extension introducing a dependence struc¬ 
ture between the claim amounts and the interclaim time using a generalized 
Farlie-Gumbel-Morgenstern copula. Gossette et al. (2004) have considered 
a variation of the compound binomial model in a markovian environment, 
which is an extension of the model presented by Gerber (1998). Gompound 
Poisson approximations for individual models with dependent risk was con¬ 
sidered by Genest et al (2003). 

A recent model of dependence between risks has been proposed by Gos¬ 
sette et al. (2013). These authors consider a portfolio of dependent risks 
whose multivariate distribution is dehned with the Farlie-Gumbel-Morgenstern 
copula, with mixed Erlang distribution marginals. 

In this paper we obtain closed expressions for the probability distribution 
function modeling aggregated risks with multivariate dependent Pareto dis¬ 
tributions between the different claim amounts. We work with the dependent 


2 



multivariate Pareto type II proposed by Arnold (1983, 2015), which is widely 
used in insurance and risk analysis. We begin with the classical individual 
risk model. For this model we obtain the probability density function (PDF), 
which corresponds to a second kind beta distribution. We also obtain sev¬ 
eral risk measures including the VaR, TVaR and other tail measures. Then, 
we consider collective risk model based on dependence, whose main general 
properties are studied. We study in detail some relevant collective models 
with Poisson, negative binomial and logarithmic distributions as primary dis¬ 
tributions. For these three models, we obtain simple and closed expressions 
for the aggregated distributions. 

The contents of this paper are the following. In Section [2] we present the 
main univariate distributions used in the paper. The class of multivariate 
dependent Pareto distribution considered to model aggregated risks is pre¬ 
sented in |3l Section 0] presents the individual risk model under dependence 
and Section [5] introduces the collective risk model under dependence. After 
presenting general results we study the compound models where the primary 
distribution is Poisson, negative binomial, geometric and logarithmic and 
the secondary distribution is Pareto. Section [H] includes an example with 
real data. The conclusions of the paper are given in Section [71 

2 Univariate distributions 

In this section, we introduce several univariate random variables which will 
be used in the paper. 

We work with the Pareto distribution with PDF given by, 

OL 

and f{x;a,(3) = 0 if x < 0, where a,(3 > 0. Here, a is a shape parameter 
and /9 is a scale parameter. We represent X ~ Va{a,(3). 

We denote by X ~ Qa{a) a gamma random variable with PDF /(x) = 
^-p{a )— if 3; > 0, whith a > 0. The exponential distribution with mathemat¬ 
ical expectation 1 is denoted by Qa{l). 

The following lemma provides a simple stochastic representation of the 
Pareto distribution as quotient of random variables. The proof is straight¬ 
forward and will be omitted. 
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Lemma 1 Let Ui and Ua independent gamma random variables such that 
Ui ~ Qa{l) and Ua ~ Ga{a), where a > 0. If /3 > 0, the random variable, 


U a 


( 2 ) 


An extension of the Pareto distribution ([T]) is the following. A random 
variable X is said to be a second kind beta distribution if its PDF is of the 
form, 


f{x;p,q,P) 


^P-i 

(3pB{p, q){l + x/ (3)p+‘i ’ 


X > 0, 


(3) 


and f{x;p, g, /?) = 0 if x < 0, where p,q, (I > 0 and B{p, q) = denotes 

the beta function. This random variable corresponds to the Pearson VI 
distribution in the classical Pearson systems of distributions and we represent 
X ~ B2{p,q,\). If we set p = 1 in Q, we obtain a Pareto distribution 
Va{q,l3) like ([1]). 

The second kind beta distribution has a simple stochastic representation 
as a ratio of gamma random variables. If Up and Uq are independent gamma 
random variables, the new random variable X = (3^ has the PDF dehned 
in (ED. 


3 The multivariate Pareto class 

Now we present the class of multivariate dependent Pareto distribution which 
will be used in the different models. 

In the literature several classes of multivariate Pareto distributions have 
been proposed. One of the main classes was introduced by Arnold (1983, 
2015), in the context of the hierarchy Pareto distributions proposed by this 
author. Other classes were proposed by Chiragiev and Landsman (2009) and 
Asimit et ah (2010). The conditional dependence structure is the base of the 
construction of the proposals by Arnold (1987) and Arnold et ah (1993) (see 
also Arnold et al, 2001), where two different dependent classes are obtained. 

Definition 1 Let Yi,Y 2 ,... and Ya be mutually independent gamma ran¬ 
dom variables with distributions Yi ~ Ga{l), i = 1,2,... ,n and Ya ~ Ga{a) 
with a > 0. The multivariate dependent Pareto distribution is defined by the 
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stochastic representation, 


ry rv ^ rv 


( 4 ) 


where /3 > 0. 

Note that the common random variable introduces the dependence in 
the model. 


3.1 Properties of the multivariate Pareto class 

We describe several properties of the multivariate Pareto dehned in ([T]). 

• Marginal distributions. By construction, the marginal distributions are 
Pareto, 


Xi ~ Va{a, l3), i = 1, 2,..., n. 
The joint PDF of the vector X is given by, 

T{a + n) 1 


T{a)/3^ (l + Er=iW/9)“+” 


, Xi,. . . ,Xn> 0. 


( 5 ) 

This expression corresponds to the multivariate Pareto type II proposed 
by Arnold (1983, 2015). 


• The covariance matrix is given by, 

cov{Xi, Xj) = - ---—, a > 2, I^J 

[a — l)^{a — 2) 

and the correlation between components is, 

^j) — ) cr > 2, i ^ j 

a 

• General moments. The moments of ([1]) are, 

E[X" ■.. x:-] = ^nr(i + r.), 

r(a) 

where A = ri + • • • + and a > A. 
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The dependence structure of X is studied in the following result 

Proposition 1 The random variables X = (Xi,..., are associated, 

and then cov{Xi, Xj) >0, if i ^ j■ 

Proof: See Appendix. I 

Remark: Let us consider the multivariate Pareto survival function of ([5]) 
given by, 


with a, ft > 0. For this family, the associated copula is the Pareto copula or 
Clayton copula, 

C{Ui, . . . ,Un]o) = H-h - U + 1 j 

Note that the dependence increases with a, being the independence case 
obtained when a —)■ 0 and the Frechet upper bound when a —)■ cxd . 


F{xi, ...,Xn)= 1 + 



Xi,... > 0, 


4 The individual risk model under Pareto de¬ 
pendence 


In this section we consider the individual risk model assuming dependence be¬ 
tween risks. The distribution of sums of iid Pareto distributions was obtained 
by Ramsay (2006). Let (Xi,... ,X„)''" be the multivariate Pareto distribu¬ 
tion dehned in (|1]). Then, we consider the aggregate risks Sn = Xi + - ■ ■ + Xn. 
We have the following result. 


Theorem 1 The PDF of the aggregate random variable Sn, where the com¬ 
ponents are Pareto defined in m is given by. 


fsSFn,a,(3) 


^n— 1 


a; > 0 


end fs„{,x; n,a, (d) = 0 if x < 0, that is Sn ~ B2{n, a, (5). 


Proof: See Appendix. I 

Note that ([1]) corresponds to the PDF of a second kind beta distribution 
dehned in ([3]). Figure [1] represents the PDF ([1]) for a = 1/2; 1;2 and 10 for 
n = 2,5, 10 and 20. 
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Figure 1: PDF of the individual model Sn for some selected values of a and 
n. 


4.1 Risk measures 


Here we present some risk measures for the second kind beta distribution, 
which can be applied for the aggregate PDF given in ([1]). The value at risk 
VaR at level n, with 0 < n < 1 of a random variable X with CDF F{x) is 
dehned as, 

VaR[X;M] = inf{a: G M, F{x) > u}. 

If V ~ H2(p, g,/9), Guillen et ah (2013) have obtained that. 


VaR[X;M] = (3 


IB ^{u]p,q) 

1 - IB-^{u-p,qy 


( 6 ) 


where IB q) denotes the inverse of the incomplete ratio beta function, 

which corresponds to the quantile function of the classical beta distribution 
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of the first kind. Then, using (E]) for the aggregate distribution ([T]) we have 


VaR[Sn; u] = ^ 


IB ^(M;n,a) 

1 — n, a) 




with 0 < M < 1. 

The following result provides higher moments of the worst events, 
which extend popular risk measures. 

Lemma 2 Let X ~ B2{p,q,(3) be a second kind beta distribution. Then, the 
conditional tail moments are given by, 


E[X^\X > Xu] 


/3'T(p + r)r(g — r) 

(i-w)r(p)r(g) 


l-B 


Xu//3 
1 + Xu/fd 


■,p + r,q — r 


where Xu = VaR[X]u] represents the value at risk with u G (0,1). 



Proof: See Appendix. I 

If we take r = 1 in ([7]) we obtain the tail value at risk TVaR, which was 
obtained by Guillen et al. (2013). 


5 The collective risk model under dependence 

In this section we consider the collective model based on dependence between 
claim amounts. Let N be the number of claims in a portfolio of policies in 
a time period. Let Xj, i = 1,2,... be the amount of the ith claim and 
S'tv = XiH— ■ + Xjq the aggregate claims of the portfolio in the time period 
considered. 

5.1 General properties 

We consider two assumptions: (1) We assume that all the claims Xi, X 2 ,..., X„, 
are dependent random variables with the same distribution and (2) The ran¬ 
dom variable N is independent of all claims Xi, X 2 ,..., X„,... 

Theorem 2 Let (Xi, X 2 ,..., X„,...) be dependent random variables with 
common CDF F{x), and let N be the observed number of claims, with PMF 





given by Pn = Pr(iV = n), for n = 0,1,..., which is independent of all Xi ’s, 
i = 1,2,... Then, the CDF of the aggregate losses Sjy is, 

OO 

= '^PnFPix), 

n=0 

where F^\x) represent the CDF of the convolution of the n dependent claims 
iX,,...,X^). 

Proof: See Appendix. I 

The mean and the variance of the collective model can be found in the 
following result. 

Lemma 3 The mean and the variance of Sn under dependence are given 

by, 

E{Sn) = E{N)E{X), (8) 

var{SN) = E{N)var{X)+var{N){E{X)f+E[N{N-l)]cov{Xi,Xj) (9) 

Proof: See Appendix. I 

If the claims Xi and Xj are independent cov{Xi, Xj) = 0 and then fl9l) 
becomes in the usual formula for nar(S'Ar). 

On the other hand, if the random variables {Xi\ are associated, 

var(S'^^) < var(S'jf^), 


where var(S'n^^) is the variance in the independent case and var(S'i^^) the 
variance in the dependent case. This fact is a consequence of the associated 
property, which leads to a positive variance between Xi and Xj. 

5.2 Compound Pareto models 

In this section we obtain the distribution of the compound collective model 
Sn = Xi + X 2 + ■ ■ ■ + X]y where the secondary distribution is Pareto given 
by dl]), and several primary distribution are considered for N. 

In the case of independence between claims Xi this distribution was stud¬ 
ied by Ramsay (2009) when the primary distribution is Poisson and negative 
binomial cases. 
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5.2.1 The compound Pareto-Poisson distribution 

We consider the model where the primary distribution is a Poisson distribu¬ 
tion. 


Theorem 3 If we assume a Poisson distribution with parameter A as pri¬ 
mary distribution and (Xi,..., X„) is defined in (Ifi, the PDF of the random 
variable Sn is given by, 


fs^{x;a,X,/3) 


aXe ^ ^ 

/3{l + x//3)<^+^^ ' 


1 -p o; 2; 


Xx/fi 
1 + x/fi 


X > 0 (10) 


and /sjy (0; a, A,/9) = e ^ where iFfia^h^z) denotes the Rummer confluent 
hypergeometric function defined by, 


iFi[a;6;z] = ^ 
n=0 


jajnZ'^ 
{b)nn\ ’ 


where {a)n represent the Pochhammer symbol defined by (a)„ = a(a—1)... (a— 
n + 1). 

Proof: See Appendix. I 

Figure [2] represents the PDF ffTOj) Pareto-Poisson for some values of a = 
0, 5; 1; 2 and 10 and A = 2, 5,10 and 20, taking fi = 1. 

Using formulas (jS]) and (jH]), the mean and variance of the dependent 
Pareto-Poisson collective model are given by, 


E{Sn) 

var{S]y) 


A/3 

a — 1 ’ 


a > 1, 


a/3^{X + 2a-2) 


{a-iy{a-2) ’ 


a > 2. 


5.2.2 The compound Pareto-negative binomial distribution 

Let X be a negative binomial distribution with PMF, 

T^(<n - 1 - 7 ^^ 

Pr(X = n) = ^^/(l - pY, n = 0,1, 2,... (11) 

We have the following theorem. 


10 











0 5 10 15 20 25 30 0 2 4 6 8 10 


Figure 2: PDF of the collective model Pareto-Poisson for (3 = 1 and selected 
values of a and A. 


Theorem 4 If we assume a negative binomial distribution with parameter r 
and p and PMF given by / fTTl) as primary distribution and (Xi,... ,X„) is 
defined in the PDF of the random variable Sn is given by, 


fs^{x]a,fi,r,p) 


ra{l — p)p'^ ^ 

/3{l + x//3r+^^ ' 


1 + r, 1 + tt; 2; 


{l-p)x//3 
1 + x/f3 


X > 0 


and fsj^{0;a,(3,r,p) = p^, where 2 Fi{a,b;c] z) denotes the Gauss hypergeo¬ 
metric function defined by, 


2 Fi[a,b-,c;z] = ^ 

n=0 


{a)n{b)nZ'^ 

{c)nn\ 


where (a)„ denotes the Pochhammer symbol. 
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The proof is similar to the proof in Theorem [3] and will be omitted. 
Again, using formulas (|H]) and ([3]), the mean and variance of the dependent 
Pareto-negative binomial collective model are given by, 


E{Sr,) 


nar(S'Ar) 


r(l — p)P 


a > 1, 


p{a - 1) ’ 
r(l -p)f3^{{l +p){a 


p^{a — lY{a 


1) r(l - p)) 


a > 2. 


5.2.3 The compound Pareto-geometric distribution 

In this result we obtain the PDF of the Pareto-geometric distribution. 

Corollary 1 If we assume a geometric distribution with parameter and PMF 
given by Pr(iV = n) = p{l — p)^, n = 0,1,... as primary distribution and 
(Xi,... ,Xn) is defined in the PDF of the random variable Sn is given 

by, 

fs^{x]a,fi,p) = ^>0 (12) 

and fsj^i0;a,l3,p) = p. 

Figure [3] represents the PDF (IT^ for some selected values of a and p 

W = !)• 


5.2.4 The compound Pareto-logarithmic distribution 


Let X be a discrete logarithmic distribution with PMF, 

1 9 ^ 

where 6 G (0,1). 

We have the following theorem. 


Theorem 5 If we assume a logarithmic distribution with parameter 9 and 
PMF given by / flgl) as primary distribution and (Xi,... ,X„) is defined in 
m, the PDF of the random variable Sn is given by, 


fs^{x-,a,fi,9) 


log(l — 6^) \_x{l + (1 — 6)x/fiY x(lPx/l3Y 


a; > 0 
(14) 


and fsNiD 1^, ^) = 0, if x < 0. 
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Figure 3: PDF of the collective model Pareto-Geometric for j3 = 1 and 
selected values of a and p. 


The proof of this result is omitted. 

Figure m represents the PDF flTTl) Pareto-Logarithmic for some values of 
a = 0, 5; 1; 2 and 10 and 9 = 0.2; 0.4; 0.6 and 0.8 taking = 1. 

The mean and variance of the dependent Pareto-Logarithmic collective 
model are 


E{Sr,) 


var^Sisf) 


afid 


a > 1, 


[a 


a/3‘^6{2{a — 1) — a9{l -|- a) -|- 6{1 + 2a) 
(«-l)2(a-2)(l-0)2 ' 


a > 2, 


being a = —1/ log(l — 9). 
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Figure 4: PDF of the collective model Pareto-Logarithmic for (3 = 1 and 
selected values of a and 6. 


6 A Numerical application with real data 

In order to compare the performance of the models presented in this paper, 
we examined a data set based on one-year vehicle insurance policies taken 
out in 2004 or 2005. This data set is available on the website of the Faculty 
of Business and Economics, Macquarie University (Sydney, Australia) (see 
also Jong and Heller (2008)). The hrst 100 observations of this data set 
are shown in Table [H with the following elements: from left to right, the 
policy number, the number of claims and the size of the claims. The total 
portfolio contains 67856 policies of which 4624 have at least one claim. Some 
descriptive statistics for this data set are shown in Table [2J It can be seen that 
the standard deviation is very large for the size of the claims, which means 
that a premium based only on the mean size of the claims is not adequate for 
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computing the bonus-malus premiums. The covariance between the claims 
and sizes is positive and takes the value 141.574. 


Table 1: First 100 observations of the data set 
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Figure |6] shows the complete number of claims and the total claim amount 
concerning these claims. It can be seen that the largest claim values appear 
in the case of single claims, while these values fall with larger numbers of 
claims. 

In this regard, the compound Poisson model has been traditionally con¬ 
sidered when the size of a single claim is modeled by an exponential distri¬ 
bution, chiefly because of the complexity of the collective risk model under 
other probability distributions such as Pareto and log-normal distributions. 

Perhaps the most well-known aggregate claims model is the obtained 
when the primary and secondary distribution are the Poisson and the ex¬ 
ponential distributions, respectively. In this case (see Rolski et ah (1999), 
among others) the distribution of the random variable total claim amount is 
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Figure 5: Number of claims and their sizes 


given by 


fsix-,a,X) 



exp(—A — ax)Ii ( 2 V Aaxj 


, X > 0, 


while /5(0;a,A) = exp(—A). Here, A > 0 and a > 0 are the parameters of 
the Poisson and exponential distribntions, respectively and 




(V2) 


2k+u 


k=0 


T{k + l)T{u + k + 1) 


, z G M, r' G 


(15) 


represents the modihed Bessel fnnction of the hrst kind. 

Additionally, the negative binomial distribntion with parameters r > 0 
and 0 < p < 1 conld also be assnmed as primary distribntion and the ex¬ 
ponential distribntion as the secondary distribntion. In this case, the PDF 
of the random variable total claim amonnt (see Rolski et ah (1999)) is now 
given by the expression 


/s(x; a, r,p) = arp^{l — p) exp(—ax)iFi[l -|- r; 2; a(l — p)x], x > 0, 
where iFi[-; •; ■] is the continent hypergeometric fnnction and fs{0;a,r,p) = 

p"^. 
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Table 2: Some descriptive data of claims and claim size for the data set 



Number of 

Total claim 


claims 

amount 

Mean 

0.072 

137.27 

Standard deviation 

0.278 

1056.30 

min 

0 

0 

max 

4 

55922.10 


When r = 1 in ([2]) we get the PDF of the total claim amonnt when the 
geometric distribution is considered as primary distribution and the expo¬ 
nential distribution as secondary. This results in 

fs{x;a,p)=ap{l—p)exp{—apx), x > 0, 
with fs{0;a,p) = p. 

Moment estimators can be obtained by equating the sample moments 
to the population moments. Furthermore, the parameters of the different 
models of the total claim amount can be estimated via maximum likelihood. 
We only present the case of Pareto-geometric case. To do so, consider a 
random sample {xi, X 2 ,..., x„}. The log-likelihood function can be written 
as 


i{a,(3,p) = no logp-F (n - no) [logo + logp 

+ log(l -p) + alog/3] - (a + 1) ^log(/3 + pXi), 

Xi>0 


where no is the number of zero-observations and n — tiq is the number of 
non-zero sample observations, where n is the sample size. 

The equations from which we get the maximum likelihood estimates can¬ 
not be solved explicitly. They must be solved either by numerical methods or 
by directly maximizing the log-likelihood function. This was the method car¬ 
ried out here. Since the global maximum of the log-likelihood surface is not 
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guaranteed, different initial values in the parameter space were considered 
as a seed point. We use the FindMaximum function of Mathematica soft¬ 
ware package v.10.0 (Wolfram (2003)). Different maximization algorithms 
such as Newton, PrincipalAxis and QuasiNewton were used to ensure that 
the same estimates are obtained. Finally, the variances of the maximum 
likelihood estimates can be estimated by the diagonal elements of the inverse 
matrix of negative second derivatives of the log-likelihood function, evaluated 
at the maximum likelihood estimates. When hypergeometric functions ap¬ 
pear, these were replaced by their series representation by taking one hundred 
terms in the sum. This facilitates the computation of the Hessian matrix. 

A summary of the results obtained is shown in Table O In this Table the 
estimated parameter values are presented together with their standard errors 
in parenthesis, the AIC and the CAIC. Bozdogan (1987) proposed a corrected 
version of AIC in an attempt to overcome the tendency of the AIC to over¬ 
estimate the complexity of the underlying model. Bozdogan (1987) observed 
that Akaike Information Criteria (AIC) (see Akaike (1973)) does not directly 
depend on sample size and as a result lacks certain properties of asymptotic 
consistency. In formulating CAIC, a correction factor based on the sample 
size is employed to compensate for the overestimating nature of AIC. The 
CAIC is dehned as CAIC = —21og£ -f (1 -|- logn) k, where again i and k 
refers to the likelihood under the htted model and the number of parame¬ 
ters, respectively and n is the sample size. As we can see, AIC differs from 
CAIC in the second term which now takes into account the sample size n. 
Again, models that minimize the Consistent Akaike Information Criteria are 
selected. Our results point out that the compound Pareto model outperforms 
the Poisson-exponential and the geometric-exponential models, which have 
been widely considered in the actuarial literature when parametric models 
are used. 
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Table 3: Summary of results for the different models considered 


Distribution 

Primary 

Secondary 

r 

P 

A 

a 


AIG 

GAIG 

Poisson 

Exponential 



0.12057 

0.87832 


51402.60 

51422.80 





(0.00104) 

(0.00759) 




Geometric 

Exponential 


0.93186 

0.53273 



49495.40 

49515.60 




(0.00097) 

(0.00785) 





Negative Binomial 

Exponential 

0.51168 

0.87090 


0.55250 


49487.20 

49517.60 



(0.00000) 

(0.0000) 


(0.00471) 




Poisson 

Pareto 



0.07058 

2.04828 

2.13071 

48229.50 

48259.90 





(0.00102) 

(0.00974) 

(0.04879) 



Geometric 

Pareto 


0.93186 


2.04655 

2.05481 

48229.60 

48260.00 




(0.00097) 


(0.08828) 

(0.12407) 



Negative Binomial 

Pareto 

0.31749 

0.80067 


2.05542 

1.91539 

48232.10 

48272.60 



(0.21505) 

(0.12052) 


(0.09000) 

(0.17324) 





Figure E] shows the PDF of the four models considered using the param¬ 
eter estimated given in Table El As we can see,the new compound models 
have a larger right tail than the traditional models based on the use of the 
exponential distribution. 


Primary distribution: Poisson Primary distribution; Geometric 




Figure 6: PDF of the compound distributions for the estimated parameters 
of the models 


Parameter estimates presented in Table [3] have been used to calculate 
the right-tail cumulative probabilities for different values of x as displayed in 
Table El As it can be inferred from this Table, for values of x < 5 the com¬ 
pound exponential-Poisson and exponential-geometric models has a slightly 
better performance than the new compound models in terms of the decreas¬ 
ing probabilities they generate. Nevertheless, the opposite result is obtained 
for X > 5. Furthermore, new compound models tend to zero slower than the 
models based on the exponential secondary distribution. 
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Table 4: Right-tail cumulative probabilities of the aggregate claim distribution for the estimated parameters 
of the models 


Primary distribution: Poisson Primary distribution: Geometric Primary distribution: Neg. Bin. 



Secondary distribution 

Secondary distribution 

Secondary distribution 

X 

Exponential 

Pareto 

Exponential 

Pareto 

Exponential 

Pareto 

1 

0.0496829 

0.0317014 

0.0414796 

0.0316985 

0.0415048 

0.0317054 

2 

0.0217140 

0.0181808 

0.0252488 

0.0181835 

0.0252360 

0.0181934 

3 

0.0094826 

0.0117457 

0.0153690 

0.0117506 

0.0153486 

0.0117580 

4 

0.0041379 

0.0081956 

0.0093551 

0.0082009 

0.0093377 

0.0082053 

5 

0.0018043 

0.0060350 

0.0056945 

0.0060403 

0.0056824 

0.0060423 

6 

0.0007862 

0.0046246 

0.0034662 

0.0046296 

0.0034589 

0.0046298 

7 

0.0003423 

0.0036541 

0.0021099 

0.0036587 

0.0021060 

0.0036577 

8 

0.0001490 

0.0029583 

0.0012843 

0.0029625 

0.0012826 

0.0029607 

9 

0.0000648 

0.0024428 

0.0007817 

0.0024467 

0.0007814 

0.0024443 

10 

0.0000281 

0.0020504 

0.0004758 

0.0020540 

0.0004761 

0.0020513 

11 

0.0000122 

0.0017450 

0.0002896 

0.0017482 

0.0002902 

0.0017453 

12 

5.3132E-6 

0.0015026 

0.0001763 

0.0015056 

0.0001769 

0.0015026 

13 

2.3057E-6 

0.0013072 

0.0001073 

0.0013099 

0.0001078 

0.0013069 

14 

l.OOOOE-6 

0.0011473 

0.0000653 

0.0011498 

0.0000658 

0.0011468 

15 

4.3357E-7 

0.0010149 

0.0000397 

0.0010172 

0.0000401 

0.0010142 

16 

1.8787E-7 

0.0009040 

0.0000242 

0.0009061 

0.0000244 

0.0009031 

17 

8.1375E-8 

0.0008102 

0.0000147 

0.0008122 

0.0000149 

0.0008093 

18 

3.5230E-8 

0.0007302 

8.9686E-6 

0.0007321 

9.1270E-6 

0.0007292 

19 

1.5246E-8 

0.0006614 

5.4592E-6 

0.0006631 

5.5729E-6 

0.0006603 

20 

6.5952E-9 

0.0006018 

3.3230E-6 

0.0006035 

3.4034E-6 

0.0006007 




7 Conclusions 


We have obtained closed expressions for the probabilistic distribntion and 
several risk measnres, modeling aggregated risks with multivariate dependent 
Pareto distributions. 

We have worked with the dependent multivariate Pareto type II. For the 
case of the individual risk model, we have obtained the PDF of the aggre¬ 
gated risks, which corresponds to a second kind beta distribution. Then, 
we have considered the collective risk model based on dependence. We have 
studied some relevant collective models with Poisson, negative binomial and 
logarithmic distributions as primary distributions. For the collective Pareto- 
Poisson model, the PDF is a function of the Kummer confluent hypergeomet¬ 
ric function, and in the Pareto-negative binomial is a function of the Gauss 
hypergeometric function. 

Finally, using the data set based on one-year vehicle insurance policies 
taken out in 2004-2005 (Jong and Heller, 2008), we have concluded that 
our collective dependent models outperform the classical collective models 
dehned by the Poisson-exponential and the geometric-exponential distribu¬ 
tions in terms of the AIC and CAIC statistics. 
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Appendix 

Proof of[T]The proof is based on the fact that the random variables (Xi,..., 
are increasing functions of independent random variables and as a conse¬ 
quence they are associated random variables (Esary et al. 1967). 

Proof of Theorem [1] We have, 

= w + ■ • ■ + 
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and the distribution of the numerator is a Qa{n) and the denominator is 
Qa{a), where both random variables are independent. Consequently, the ra¬ 
tio is a second kind beta distribution with PDF given by (IT]) . 

Proof of Theorem [2] The proof is direct, 

Fs,{x) = Pt{Sn<x) 

oo 

= <x\N = n)PT{N = n) 

n=0 

oo 

71=0 

where now [x) represents the CDF of the convolution of the n dependent 
claims (Xi,..., X„). 


Proof of Lemma [2] The tail moments can be expressed as. 


E[X^\X > a] 


STx'dFxjx) 
1 - Fx{a) 


where a = VaR[X;M]. Using the incomplete distribution of the second kind 
beta distribution we obtain the result. 


Proof of Lemma [3] The formula for E{Sn) is direct. Formula for varlSx) 
can be obtain using the identity var{Sx) = E[var[SN\N]] + var[E[SN\N]]. 


Proof of Theorem [3] If a; = 0, /s^(0) = Pr(X = 0) = e If x > 0 and 

\xfl3 
1+X//3'’ 


calling z = 


fsNix) = ^fP{x)PT{N = n) 


n=l 

oo 


X 


n—1 


e-^X 


^/3’"R(n,a)(l+ x//5)’"+“ n! 

P(n -|- a) / \x/{3 


X e 




P(a)(l-1-x//5)" “ P(n)n! \l + x/(3 
-zP(o -|- l)iFi[l -f- o; 2; z], 


X ^ 


P(a)(l x//3y 
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and we obtain (flU]) . 


Proof of Corollary [T] The proof is direct nsing Theorem 0] and taking into 
acconnt that, 

2F,[2,l + a;2;z] = {1 - z)-'^-K 
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